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Abstract 

Caching the popular multimedia content is a promising way to unleash the ultimate potential of 
wireless networks. In this paper, we contribute to proposing and analyzing the cache-based content 
delivery in a three-tier heterogeneous network (HetNet), where base stations (BSs), relays and device-to- 
device (D2D) pairs are included. We advocate to proactively cache the popular contents in the relays and 
parts of the users with caching ability when the network is off-peak. The cached contents can be reused 
for frequent access to offload the cellular network traffic. The node locations are first modeled as mutually 
independent Poisson Point Processes (PPPs) and the corresponding content access protocol is developed. 

The average ergodic rate and outage probability in the downlink are then analyzed theoretically. We 
further derive the throughput and the delay based on the multiclass processor-sharing queue model 
and the continuous-time Markov process. According to the critical condition of the steady state in the 
HetNet, the maximum traffic load and the global throughput gain are investigated. Moreover, impacts 
of some key network characteristics, e.g., the heterogeneity of multimedia contents, node densities and 
the limited caching capacities, on the system performance are elaborated to provide a valuable insight. 
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I. INTRODUCTION 

The total mobile data traffic of 2020 will increase 1000 times compared with the 2010 traffic 
level [[2]l. Despite the deployment of the fourth generation Long Term Evolution (LTE) and 
LTE-Advanced systems, the rapidly increasing wireless data demands overwhelms the through¬ 
put increase that the wireless network could afford. Various innovative throughput-increasing 
methods have been investigated to tackle the ever-growing wireless data challenge, such as the 
heterogeneous network (HetNet) j3] and the cache-enabled content-centric network @]-[[6]]. The 
state of the art is elaborated in the perspective of the two aspects respectively in the following. 

HetNets, bringing the network closer to users: One widely regarded as the cornerstone 
technology is denser node deployment, including macro base station (BS), micro BS, pico BS, 
femto BS and relays. Such a HetNet decreases the distance between BSs/relays and users, 
and thus increases the area spectral efficiency, yielding the increase of network capacity IfTTl- 
|f9l . However, the exponential growth in traffic also requires the high-speed backhaul for the 
connection of different type of BSs/relays and content servers iflOl ifTTll . 

Cache-enabled content-centric networks, bringing the content closer to users: It has been 
shown that 70% of the wireless traffic is from multimedia contents, e.g., videos [21. Meanwhile, 
the multimedia contents are not accessed with the same frequency. Only a small fraction (5 — 10%) 
of “popular” contents are consumed by the majority of the users, and the less popular contents are 
requested by a much smaller number of users lfl2l . Moreover, following the uncannily accurate 
Moore’s law, a tremendous amount of computing and storage capacity is held by the intelligent 
terminal devices and networks. As such, the popular contents can be cached in BSs, relays and 
devices, bringing the content closer to users. It allows users to access to the cache-enabled nodes 
and reduces the duplicate content transmissions, mitigating the over-the-air traffic llT3l . 

Therefore, taking advantage of the caching capability within the wireless HetNet, the content 
diversity and network diversity can be exploited to relieve the burden of the fast growing traffic 

a, m, o. 

A. Related Work 

The role of the caching technology in the fifth generation (5G) wireless network is demon¬ 
strated in m, nia. Urs Niesen et al. investigate a large wireless caching network with the 
hierarchical tree structure of transmissions, and scaling results on the capacity region are derived. 
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An arbitrary traffic matrix and cooperative transmissions over arbitrarily long links are assumed 
m. lITbll introduces the distributed caching at the macro BSs to improve the network capacity 
and reduce the video stalling. The authors of IflOl advocate to set up relays with caching ability 
in the cellular network to reduce the access delay. The content placement scheme has received 
significant attention, e.g., lfT7l proposes a novel coded caching scheme to improve both the local 
and the global caching gain. l!T8l considers the scenario where a user in the overlapping coverage 
area can connect to any of the stations covering it. The optimal caching strategy maximizing 
the caching hit ratio is formulated by solving the Geographic Caching Problem. Optimal request 
routing and content caching are investigated in lfl9l to minimize the average content access delay. 
In ll20ll . the energy consumption is minimized by appropriately pre-caching popular contents. 
OTI studies how to disseminate the content via cellular caching and Wi-Fi sharing to trade off 
the dissemination delay and the energy cost. Based on the content popularity, the cache-based 
multimedia content delivery scheme is proposed and analyzed in @. Terminal users can share 
the received content via opportunistic local connectivity to offload the traffic of cellular links in 
|f22j. P3l exploits redundancy of user requests and the storage capacity of terminal devices via 
dividing the cell into virtual square grids. 

However, in the current research of caching, the assumption of global knowledge of the 
stationary network topology and the node connectivity graph is critical, and the regular grid 
network model is too optimistic and idealistic to fully capture the randomness and complexity 
of node locations in the HetNet nowadays. Different from the traditional system model, lots of 
researches have pointed out that the node location obeys PPP instead of regular hexagonal grid in 
realistic HetNet JTj, [HI, lf24l . Il25ll . Two tiers of BS locations are modeled as independent PPPs 
in lf26l where joint resource partitioning and offloading are analyzed in the HetNet. |[27ll studies 
the optimal node density in homogeneous and heterogeneous scenarios by modeling cellular 
networks with PPP. The authors in lf28ll model the node locations of the multi-tier HetNet as 
mutually independent PPPs, and analyze the system performance in terms of the average rate. 
[f29ll takes the limited backhaul into consideration to analyze the performance of the homogeneous 
cache-enabled small cell network, where the nodes of the small base stations are stochastically 
distributed. A constant service rate is assumed when the files can be found in the local cache 
and the downlink capacity exceeds the threshold. In OOl . disjoint circular clusters are scattered 
based on the hard-core PP. Requesting users and cache-enabled users are distributed with two 
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independently homogeneous PPPs. The requesting users obtain the content from cache-enabled 
users in the same cluster via the out-of-band device-to-device (D2D) in the cellular network. 

Furthermore, the traditional fetching and reactive caching methods doesn’t intelligently utilize 
the service characteristic such as the traffic redundancy and the content popularity. Few studies 
considers the scenario where the radio access network (RAN) caching and the D2D caching 
coexist. Meanwhile, the performance of the wireless cooperative caching HetNet is not yet 
fully investigated. How much performance improvement actually can be reaped via the caching 
technology is urgent to be answered theoretically. 

B. Contributions 

Towards these goals, in this paper we analyze the scheme that when the network load is off- 
peak, the most popular contents can be cached at the nodes via broadcasting. The BSs, relays and 
cache-enabled users are cooperative to transmit contents in the HetNet. The main contributions 
of this paper are summarized as follows: 

• We consider the limited caching ability of both relays and parts of the users. Popular contents 
are cached when the network is off-peak. Besides the cellular communication, there exists 
the local content sharing links from the cache-enabled user to the users. When a user triggers 
a request, it can be responded by BSs, relays or the cache-enabled users. 

• We model the node locations (BSs, relays and users) of the three-tier HetNet (BSs-users, 
relays-users, users with caching ability-users) as mutually independent PPPs. The content 
access protocol is then proposed, based on which the tier association priority is formulated. 

• We derive analytical expressions of the average ergodic rate and outage probability for users 
in different Cases. Then with the modeling of the request arrival and departure process at the 
service node as a multiclass processor-sharing queue, the throughput and delay of different 
classes are further analyzed based on the continuous-time Markov process. 

• We propose the steady ruler and the critical point for the HetNet to keep steady, according 
to which the throughput and the maximum traffic load over the entire network are then 
evaluated. Moreover, impacts of the cache-enabled users, content popularity and the limited 
storage capacity on the network performance are analyzed. 

The remainder of the paper is organized as follows: In Section II, we formulate the three- 
tier HetNet architecture and elaborate the tier association priority based on the content access 
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protocol. The average ergodic rate and outage probability are derived in Section III and Section 
IV. The performance gain in terms of the throughput and the delay are analyzed in Section V. 
In Section VI, numerical results are presented. Finally, we give our conclusions in Section VII. 

II. System model and Protocol description 

In this section, we first model the nodes of the three-tier HetNet as mutually independent PPPs 
with different densities. Then the cache-enabled content access protocol is described. Afterwards, 
the probability of the tier association priority and the state of users are derived. 


A. Network Architecture 


Consider a three-tier wireless HetNet consisting of a number of macro BSs, relays and users 
as illustrated in Fig. Q] The nodes of the z-lh tier (i = 0,2,3 for the users, relays and BSs, 
respectively) are deployed based on an independent homogeneous PPP -01 with intensity A, 0, 
fa,®. Note that in the practical system there are more users than relays or BSs, so we consider 
Ao A 2 > A 3 in this paper. There are N multimedia contents on the multimedia server, where 
all the contents are assumed to have the same size of S [bits]. Each of the relays has a limited 
caching storage with the size of M 2 x S [bits], but only a part (e.g., the 0 < a < 1 proportion) of 
the users has caching ability and the corresponding size is Mi x S [bits], and Mi <C M 2 <C N. 
According to Poisson processes, the locations of the cache-enabled users are distributed as a 
thinning homogeneous PPP with density Ai = oA 0 . 

It has been observed that people are always interested in the most popular multimedia contents, 
where only a small portion of the contents are frequently accessed by the majority of users Ifl2ll . 
The higher ranking of a multimedia content, the greater the requested probability. The popularity 
of the z-rankcd content can be modeled by the Zipf distribution as follow |0, lUIl - ETil 


fi = 


1/P 


( 1 ) 


where 7 > 0 reflects the skew of the content popularity distribution. The larger 7, the fewer of 
popular contents accounting for the majority of the requests. 


B. Cache-enabled Content Access Protocol 

A high-capacity wired backhual solution can be used for the connection link between BSs 
and relays, e.g., optical fiber. When the network is at low traffic load, e.g., the traffic load in the 
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Fig. 1. Cooperative caching in heterogeneous networks: The plot on the top right side is a snapshot of different nodes deployed 
with PPPs: BSs (red circle) are overlaid with relays (green circle). Parts of users (black circle) have caching ability, the others 
(black rectangle) do not. The structure of a typical BS cell is highlighted in the lower plot. 


nighttime, the most popular contents can be cached at the relays and the cache-enabled users 
via broadcasting lf29ll . All the cache-enabled users store the same copy of the contents until the 
caching storage is fully occupied, and those cached in different relays are also the same. 

When a user requests a multimedia content, it first checks whether the caching storage is 
available in its local devices. If the requested content is cached in its caching storage, the user 
can obtain the content immediately; otherwise, the user access the “closest” node. Here, we 
define the node providing the maximum received power as the “closest” node of a requesting 
user. The user’s received power is defined as 0, f27l . |[28l . 

Ci = vBiPirr?, ( 2 ) 

where Pi for i = 1,2,3 is the transmit power of the node in the /’-th tier. When i = 1 it means 
the user gets the content from a cache-enabled user via the local sharing link such as D2D 
1(61 ll23l lf30l lf3Tl considered in this paper. /3 > 2 denotes the path-loss exponent and r, is the 
distance between the requesting user and its closest node of the i-th tier, u denotes a propagation 
constant and is normalized as 1 in this paper. For clarity, the association bias B % of the z-th tier 
is assumed to be 1. Thus, the closest node is arg max t C,. As a result, there are four content 
access Cases in the three-tier HetNet as described below. 
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Case 1: The requesting user does not have caching ability, and it can successfully obtain the 
requested contents from the closest node (BS, relay or the cache-enabled user). 

Case 2: The requesting user has caching ability, but the requested content is not cached in 
its local caching storage, which means all the cache-enabled users do not store the requested 
content. Thus, only the closest node in the relay or BS tier can respond to the request. 

Case 3: The requesting user does not have caching ability, and its closest node is a cache- 
enabled user. However, the corresponding cache-enabled user does not cache the requested 
content due to the limited caching storage. So the requesting user needs to obtain the content 
from the closest node in other tiers, i.e., the relay or BS. 

Case 4: The requesting user has caching ability, and it can obtain the requested content from 
its local caching storage immediately. 

The HetNet without caching is considered as a baseline in this paper. In the baseline, neither 
users nor relays have caching ability. Therefore, BSs could not pre-broadcast the popular contents 
to the users and relays. And the local sharing links (D2D) among users can not work at this 
time. If the closest node of a user is a relay, the relay needs to fetch the content from the BS 
via wired backhaul firstly and then forwards it to the user; If the closest node is a BS, the BS 
responds the user’s request. Similarly, in the caching network, if the content is not cached in the 
relay, a Backhaul-needed ( BH-needed ) event happens and the relay needs to fetch the content 
from the BS via the backhaul firstly; otherwise, the Backhaul-free ( BH-free ) event happens and 
the relay can respond the request immediately without the backhaul. 

C. The Probability of the Tier Association Priority 

As described above, the locations of users, BSs and relays are modeled as mutually independent 
PPPs. Therefore, the probability that there are n nodes in area A with radius of r is given by 



(3) 


where n = 0,1,2,... and i = 0,1,2,3. Without loss of generality, according to Slivnyak’s 
theorem E4l . we conduct analysis on assumption that there is a typical user with or without 
caching ability at the origin of the Euclidean area, and it is regarded as the reference user. 

We first analyze the scenario where the reference user does not have caching ability with the 
probability of 1 — a. So the probability that the distance between the reference user and its 




closest cache-enabled user is larger than ry is 


P (y > ry) = P (Oini/ylry) = e 


- p -i r^irj 


(4) 


Therefore, the probability density function (PDF) of the distance from the reference user to the 
closest cache-enabled user is given by 

d (1 — P (y > ry)) 


fRiiri) = 


dr i 


= 2 vrAir 1 e- ,rAir i. 


(5) 


Similarly, the PDF of the distance from the reference user to its closest relay and BS are 

fn,i r i) = 27rAjrye -7rAir s % = 2,3, (6) 

respectively. As a result, the joint PDF can be given by 


/Ri,i? 2 ,-R 3 ( r 1 > r 2 , X — n 27tA; 


-7T X i V i 

e i=1 


(7) 


. i=l 


To derive the main conclusions in the following, we first consider the general A'-ticr HetNet 
with PPPs with parameters A* and P t , i = 1,2 , K. Denote C ti ,i = 1, 2, K, as the maximum 
received-power from the fj-th tier, where t t G {1, 2, K} means that the value of the maximum 
received-power from the tj-th tier is ranked /-th. We thus have the following proposition. 

Proposition 1: The probability of C L] > C h > ... > C, K is 


K -1 


P(Ct! >C t2 >■■■>C tK ) — ] 


n— 1 


K 


A; 


E tm 

aT 


p t 

Ln 

~P. 


-1 


( 8 ) 


Proof: See Appendix A. ■ 

For the three-tier HetNet of this paper, the probability of C t > Cf > Cf - 1 f ] f k G {1,2, 3} 
is then given by 
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P(Q > Cj > C k ) = 


1 I 


A,- V P 


2-1 —1 


hi ( El 

^ A v V Pi 


.3 = 1 


-1 


(9) 


We observe from Q that the reference user without caching ability prefers to obtain the content 
from the i-th, j-th, and A’-th tier in turn with probability of P(Q > Cj > 0,). Proposition |T| can 
be further extended to the following lemma. 

Lemma 1: The probability that the reference user without caching ability prefers to associate 
with the i-th tier at first in the A'-ticr HetNet is 


Gk,i = P {Ci > max C n ) = 

’ Mn+i 


K 


A m, ( Pf 


-1 -1 


E / 'ra / ^ rri 

X vx 


m= 1 


( 10 ) 












Proof: See Appendix B. 


So in the three-tier HetNet, the probability that the reference user without caching prefers to 
get the content from the i-th tier at first is 



( 11 ) 


Likewise, as to the scenario where the reference user is cache-enabled, the probability of 


Ci > Cj,i f j E {2,3} is 



( 12 ) 


Equation (fl2l) means that the reference user with caching ability prefers to obtain the content 
from the i-th, j-th tier in turn with probability of P(Q > Cj) when the requested content has 
not been cached in the local caching. P(Q > Cj > Ck) and P(C; > Cj) will be denoted as Py^ 
and Py respectively for convenience in the following. According to (ITTh and (fl2)) . we find that 
the tier association priorities are different when the user is cache-enabled or not. Users prefer 
to connect to the tier with higher transmit power and node density. 

D. The Density of the Active D2D Transmitters 

In the subsection above, we have analyzed the tier association priority merely based on the 
geographical locations, where the detailed impacts of the limited caching space and the content 
popularity are not considered. Define C E {Case 1, Case 2, Case 3, Case 4} as the Case the 
user may be active in. Let T E {Tier 1, Tier 2, Tier 3, Local} be the node where the user can 
obtain contents. Let W E {BH-needed, BH-free} describe whether the backhaul is needed for a 
user to access the content successfully. We only consider the wired backhaul between the BS 
and the relay, the impacts of the backhaul from the multimedia server to the BS are out of the 
scope of this paper. Denote x — (C, T, W) as the state of the user. Probabilities of different x 
are listed in Table [Q where we rewrite ff'fa fi as F{ a -> b) for simplification. Assign the value 
in the i-th i E {2, 3,..., 9} row j-th j E {3,4,..., 6} column of Table [J to the element 
of a matrix D 8x4 . 

Based on Table HI the probability that a user obtains the content successfully via the D2D 
link is <? 3 ,i(l— a)F(l, Mf), i.e., D\ i. So the density of users to be served by D2D transmitters 
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TABLE I 


Probabilities of that the user is active in different states. 


P [ X = (C,T,W)] 

Tier 1 (D2D) 

Tier 2 (Relay) 

Tier 3 (BS) 

Local 

Case 1 

BH-free 

d 3 ,i(l-a)F(l,Mi) 

g 3 ,2(l-a) F(1,M 2 ) 

03, 3(1 —a) 

0 

BH-needed 

0 

g 3 .2(l-a)F(M 2 +l,N) 

0 

0 

Case 2 

BH-free 

0 

M 2 ) 

¥ 3t2 aF(M 1 + l,N) 

0 

BH-needed 

0 

^ > 2. 3 otF(M 2 -\-l, N) 

0 

0 

Case 3 

BH-free 

0 

Pi, 2 . 3(1 — a)F[M\-\-l, M 2 ) 

Pi,3,2(l-a)F(Mi + l,A0 

0 

BH-needed 

0 

Pi,2,a(l-a)F(M 2 +l,iV) 

0 

0 

Case 4 

BH-free 

0 

0 

0 

aF(l, Mi) 

BH-needed 

0 

0 

0 

0 


(TXs) is Ao<? 3 ,i( 1 — a)F(l, Mi). However, the density of the cache-enabled user is A 0 a, which is 
the maximum density of D2D TXs. Define A' as the density of the actually active D2D TXs. If 
a small fraction of users have caching ability, in the coverage of a cache-enabled user, there is 
at least one user to be responded via the D2D link, i.e., the density A' x is aA 0 . At this time the 
number of D2D links are limited by the number of cache-enabled users. All of cache-enabled 
users should be active as D2D TXs to satisfy the demand for the D2D link. However, if most of 
users are cache-enabled, some cache-enabled users may not cover any user in the corresponding 
coverage. The density of cache-enabled users active as D2D TXs is A' x = (1 — a)A 0 l? 3 ,ii ? (l, Mi), 
which is smaller than o Ao. At this time the number of D2D links are limited by the number 
of the users without caching ability, and not all of cache-enabled users are active as D2D TXs. 
Thus, the node density of the active D2D TXs can be given by 

X[ = min {«Ao, (1 — ct)Aol? 3 ,iT 1 (l, Mi)} . (13) 

We define a* as the critical point deciding whether all of cache-enabled users need to be 
active as D2D TXs. Let aA 0 = (1 — a)X 0 Q 3t i J2i=i fi we § et the critical point, 

a* = max { 0 , [F(l, M,) - h] [1 + F( 1 , Mi)}- 1 } , (14) 

where h = =2 ■ From (fl4l) we observe that whether all the cache-enabled user need to 

be active as D2D TXs is jointly decided by the user caching ability (Mi), the content popularity 
( 7 ), the transmit power (Pi), the node density (A,) and the path-loss exponent (3). a* increases 
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with the increase of M 1; 7 and 6. Then equation (fl3l) can be rewritten as 


aAo, a < a*; 

(1 -a)XoG 3 ,iF(^,Mi), a — a * ■ 


(15) 


Next, we introduce another variable a which decides the maximum density of the D2D TXs in 
the network with various a. Based on the first derivative of (1 — a)Ao^ 3 ,iT 1 (l, Mi) with respect 
to a we can get a = \/h 2 + h — h. The density of the D2D TXs increases with a in the region 
[0, a] and starts to decrease from a. It implies at most a D2D links can be set up in unit area. 

For the other two tiers, as considered above that all the relays and BSs are fully loaded and 
active when A 0 A 2 > A 3 , the actually active node density of the BSs and relays equal to the 
corresponding node density, i.e., A' = Aj for i = 2, 3. Therefore, the nodes of the actually active 
D2D TXs, relays and BSs are scattered according to mutually independent homogeneous PPPs 
<l>j, i = 1, 2, 3 with the density A', respectively. 


III. The Average Ergodic Rate 


The average ergodic rate in the downlink is analyzed in this section. Specifically, the com¬ 
munication link between the relay/user and the requesting user is assumed to share the same 
frequency with that from the BS to the users, yielding the interference. There exist two types of 
interferences, namely, the inter-tier and the intra-tier interference. The full load state of the BS 
and relay is considered and user requests arriving at the same service node are responded one 
after the other in a round-robin manner Q. We shall note that the rate analyzed in this Section 
refers to that over the air, and the effect of the backhaul will be considered in Section lYl 
Therefore, the signal-to-interference-plus-noise ratio (SINR) of the reference user associated 
with the node in the f-th tier is 


SINRj(a;) 


Pi9i,ox 


-0 


PiQiflX 13 A P t g t nX /5 


E E Pj h jk\Yjk\-P + E / i + 0 ' 2 


(16) 


j =1 fce^VB^o i= l 

where a 2 denotes the power of the additive noise, x is the distance between the reference user 
and its serving node, fjk.o and hj k denote the channel power gain. Here, we consider Rayleigh 
fading channels with average unit power, yielding g k)0 ~ exp(T), h. )k ~ exp(l). \Y jk \ is the 
distance between the reference user and its interfering nodes k in the j-th tier. Ij denotes the 
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cumulative interference from the y-th tier. We define the average ergodic rate U ,. i = 1,2,3 of 
the reference user when it communicates with the i-th tier as m, m, Ea, m, Esi, 

U, = E a ,[E SINR Jln(l + SINRj(x))]]. (17) 

Here, the unit of the average rate is nats/s/Hz (1 nat = 1.443 bits) to simplify the analysis. The 
average is taken over both the channel fading distribution and the spatial PPP. The ergodic rate 
is first averaged on condition that the reference user is at a distance x from its serving node in 
the i-th tier. Then the rate is averaged via calculating the expectation over the distance x. The 
metric means the average ergodic rate of a randomly chosen user associated to the 2 -th tier. 


A. The Average Ergodic Rate in Case 1 

Denote X, as the distance between the reference user and its serving node of tier i. Based on 
the proof in 11281 . we can obtain the PDF of X, as follow, 


, 2 vrA 4 — £ 
fx t (x) = ~^xe J- 1 

r/3,z 


(18) 


Then we have the following theorem. 

Theorem 1: The average ergodic rate of the reference user associated with the i-th tier (i = 
1, 2, 3) in Case 1 is 




27tA^ 

G:i,i 



0 JO 


xexp < —x^P i 1 (e t — l)cr 2 


7rA iX 
Qi.i 


2 r 


14 


Ai + (a / 1 -a 1 )^ 3 ,i 


AiZfV-l) 


dfdx. (19) 


Proof: See Appendix O ■ 

Since the node densities are typically quite high in the HetNet, the background noise is far 
smaller than the interference power. The interference is dominant and the noise can often be 
neglected, i.e. (a 2 —* 0), then the rate is further simplified to 

1 


U u = 


f ° i+ AiZfV-1) 


■d t. 


( 20 ) 


According to dT5l) . equation (l20l) can be further rewritten as 

POO 


a < a*; 


Mi ,i — < 


Mi 


1 + ( 1 + “^^3,1 E/i- ^3,1 ) ^i(e*-l) 

i= 1 


-1 -1 


( 21 ) 


d t, a > a* 
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Equations ( l20l) and (I2TI) reveal that when the interference is dominant, the average ergodic 
rate in Case 1 is independent on which tier the user connects to. Moreover, from (1211) we 
know that, when the interference is dominant and the fraction of the cache-enabled users is 
small (i.e.,a < a*), the average ergodic rate keeps constant. On one hand, the rate keeps 
constant when a varies in the region of [ 0 , a*] such that all of the cache-enabled users should 
be active as D2D TXs. Higher density of D2D TXs gets the content closer to users while 
adding additional interference caused by the increase of the D2D pairs. On the other hand, 
the rate keeps constant independently on system parameters such as the transmit power P. L and 
node density. This means that raising the transmit power or service node densities increases 
the desired signal power and the interference by the same amount, and they offset each other. 
However, the parameters affect the number of simultaneously active nodes in unit area. As an 
example, with larger a and caching ability Mi, more users can get contents via the D2D link or 
immediately from their local caching, yielding the change of the sum rate of the cache-enable 
network. However, when a > a*, the average ergodic rate increases with the increase of a based 
on (1271) . It is because the distance between the user and the D2D TX is reduced with larger 
number of cache-enabled users, but not all the cache-enabled users need to be active as the D2D 
TXs at this time, breaking the balance between the desired signal power and the interference. 

B. The Average Ergodic Rate in Case 2 

Similar to (fl 8 l) . the PDF of the distance between the reference user and its serving node of 
tier i in Case 2 is 



( 22 ) 


We then calculate the average ergodic rate for Case 2 as follow. 

Theorem 2: The average ergodic rate of the reference user associated with the i-th tier (i = 
2, 3) in Case 2 is 



Proof: See Appendix [Dl 


( 23 ) 
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When the interference is dominant, i.e., a 2 —» 0, we obtain 


3( 2 A ~ 


dt. 


(24) 


U 2i = 


(25) 


0 1 + i?i(e f -l) + ^T=^Z 2 (a) 

According to (IT5l) . (l24l) can be rewritten as 

POO 

/ --dt, a < a* 

lo l+2i(e*-l)+ T ^-^(a) 

/•°° r Mi n -1 

/ 1 + -2i(e* —1) + Q! / 1 _ e 3 ’) X] /i-2 2 (a) dt, a > a*. 

/o L ’ i=l 

Compared (1251) with (12TT) . we find that the average ergodic rate of Case 2 is smaller than 
that of Case 1. D2D TXs bring out additionally unnecessary interference to the users in Case 2, 
decreasing the rate. According to (j25jl . when a < a*, the average ergodic rate in Case 2 decreases 
with the increase of a because Qz,\ increases with the a. Furthermore, set the first derivative of 
U 2 ,i with respect to a to zero, we can get a critical point a, which exactly is the point getting 
the maximum number of active D2D TXs explained in subsection III-DI The number of active 
D2D TXs increases monotonically with the increase of a when a < a; otherwise, it decreases 
monotonically when a > a. More active D2D TXs lead to more unnecessary interference to the 
users of Case 2. Therefore, the rate continues decreasing with the increase of a in the region 
[a* , a] but it starts to increase from a. On the whole, the rate in Case 2 decreases in the region 
[0, a] and then increases from a. Any of the network parameters such as the node density (A*), 
the content popularity (7), the transmit power ( Pi ), the path-loss parameter ( 3 ) and the caching 
ability (Mi) can affect the trend of the rate in Case 2. 


C. The Average Ergodic Rate in Case 3 

Based on the definition of Case 3, we have C\ > Cj > C k , ( 3 , k) e {(2, 3), (3, 2)}. As 
described above, Xi is the distance between the reference user and its closest cache-enabled 
user. Let Yj be the distance between the reference user and its closest node in the j-th tier for 
j = 2, 3. Then the joint PDF of x, y in Case 3 is 


fx 1 ,Y j (x,y) = 


4:7T 2 \i\jxy 


Pi 


j,k 


exp 


-itXiX 2 — nX ~y 2 


_ At. Pb, 1 

+ T ( p i )f 


, lf y>(-^-) p x. 
Pi 


(26) 


If y<{^) p x, fx, y.j (xry) = 0. The proof is derived in Appendix [0 As a result, we obtain the 
following theorem, 
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Theorem 3: When the interference is dominant, the average ergodic rate of the reference user 
associated with the j-th tier (j — 2, 3) in Case 3 is 

2 x( 1 -£ 3 ,i )- 1 


OO pi 


^3 ,j - 



°‘ , o|l + z 1 ( e .-l) + g£ 


l+|2 3 (e‘-l) 


rdxdf. 


(27) 


Proof: See Appendix [F| ■ 

We can observe that when the interference is dominant, the average ergodic rate of the 
reference user in Case 3 is also independent on which tier the user connects to. Furthermore, 
denote Ui as the average ergodic rate of the reference user in Case 4. Ui shall be considered as 
a extremely fast speed with which the user can read out the contents from its local caching disk 
immediately. The higher the content popularity ( 7 ) and caching ability (Mi) become, the higher 
probability there is for users to be active in Case 4. 


IV. The Outage Probability 

Besides the average ergodic rate elaborated in the previous section, we will derive another 
important performance metric, i.e., the outage probability in this section. The outage probability 
can be defined as the probability that the instantaneous SINR of a randomly located user is less 
than a threshold r. Let Vi be the average outage probability of the reference user associated 
with the Ath tier, which can be expressed as 10, I®, (25l . If26l . (28], (29], 

Vi = E[P[SINIC(x) <t]]. (28) 


The metric can be equivalently interpreted as the average fraction of the cell area where the 
receiving SINR is smaller than a specific threshold. It is also exactly the cumulative distribution 
function (CDF) of the SINR over the entire network. The outage probabilities of the different 
Cases are analyzed in the following. As to Case 1, we have the following theorem. 

Theorem 4: The average outage probability of the user connected to the i-th tier (1 = 1, 2, 3) 
in Case 1 is 


2 rrXi I P 

Via = 1 ~——/ xe 


xPPt irXiX 2 Yl + (*i-*l)ff3,l 

A i 2 r 1 u> J d X ' 


63 ,i 


JO 

Proof: See Appendix 0 

For the special scenario where the interference is dominant, i.e., a 2 —> 0, we have 

Ai + (A) — Ai)f? 3 j i ’ 1 


(29) 


V u = 1 - 


1 + 


Ai Zf\r) 


(30) 
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Theorem 5: The average outage probability of the user connected to the i-th tier (i = 2, 3) 
in Case 2 is 


V 2 a =1 


2 ir\i 

■pr 


xexp <! — 1 rcr 


7tA iX 2 


- fl 2a 2 P h 1 2.0 2 . 

1 - p,Z -g, 


h3 JO 

where Z 2 (a) = r 

Proof: See Appendix |H1 
When the interference is dominant, we get 

V 2 a = 1 - 


IV 


l + ZAr) 


a; Qk 


2 1 
p~ a 21 


A i 1 — ^3,1 

and a is as small as 0. 


Zo(a) 


dx, (31) 


1 + Z!(r) + ^j^^Z 2 (a) 

Theorem 6: When the interference is dominant, the average outage probability of the user 
connected to the j-th tier (j = 2, 3) in Case 3 is 

2x(1-0 3 ,i)' 1 


(32) 


^■ = 1 - 


1+2 ‘ M+ftgr 


i+D 


r dx, 


(33) 


where Z 3 (r) = jf^x ^ 2 F 1 [1,1 - ■§; 2 - §; -r.x ^]. 


/?’ 


Proof: See Appendix HI ■ 

From Theorems 01 [5] and [6] we see that, similar with the average egodic rate, the average outage 
probability is not affected by which tier the user connects to when the interference is dominant. 
As to users in Case 1, when the fraction of cache-enabled users is small, the interference and 
the desired signal power change by the same amount with the change of the transmit power or 
the node densities. However, the unnecessary interference triggered by D2D TXs depraves the 
outage probability in Case 2 and Case 3. Moreover, let V/ be the outage probability of the user 
in Case 4, which is as small as 0 because of the immediate reading from the local caching disk. 


V. The Throughput and The Delay 

We have analyzed the performance metrics from the perspective of a single user. Based on the 
analysis results of previous sections, the throughput of the entire network will be derived in this 
section. The delay and the critical condition for the network to keep steady will be elaborated. 

We conduct analysis in a typical BS cell. According to the PPP model for the node locations, 
the average number of users in a typical BS area is A f[28Tl . We now introduce the traffic dynamics 
of request arrivals and departures. Requests of Aa users are considered as a unified event and 














17 


modeled as a Poisson process with parameter ^ [requests/s], i.e., the request interarrival times are 
exponentially distributed random variables with mean 1 seconds [[5]] ll32l . It implies that requests 
of a single user is a Poisson process with parameter ^ [requests/s]. The arriving requests require 
to access some sets of contents. Volumes of the sets are independent exponentially distributed 
random variables with mean 1 [contents/request], and the request interarrivals and request sets 
are independent 0 |[32j We define e as the total request arrival rate and a = ^ as the total 
traffic demand (in [bits/s]) in the typical BS cell. 

Based on Table HI each element of matrix D represents the probability that a randomly chosen 
user is active in the state of x = (C, T , W). Therefore, (i,j) for z = 1,2,..., 8 and j = 1, 2,..., 4 
can represent the state of the user with the mapping g : ( C,T,W ) —>■ (i,j). Then the density 
of users in the state of (i,j) is X l , J = A 0 Aj- Corresponding to the consideration in Section IHfl 
that service nodes are in the full load state, in this section, BSs, relays and D2D TXs without 
user being served are assumed to make dummy transmissions which bring interference to others 
as well lf33fl . Consider w Hz bandwidth are shared among different tiers. Let element A i3 of 
matrix A 8X 4 denote the average ergodic rate of the user in the state of (z, j). A is generated by 


{ A 2m —l,j — l,j 7 ^ 0)> fot" Tfl — 1,2, ...,4 J 

A 2 m,j = r]wf{U m j)l( y D 2mtj ^ 0 ), for m = 1,2, ...,4, 

where l(-) is the indicator function and z/ = 1.443 is the conversion factor between [nats] and 
[bits]. U m j is the average ergodic rate analyzed in Section [111] and A, 4 = A- We consider U 2> i, 
A,i and W 4j - (for j = 1, 2, 3) as 0 just like we define the matrix D even though no user is 
active in these states, and these virtual variables are defined to simplify the description. Due 
to the delay caused by the additionally wired transmission process and the limited backhaul, 
we assume the users can get the content with the service rate of / {U m ,j ) when the backhaul is 
needed, which is a function of U m j and is smaller than U. m j . 

In the coverage of a D2D TX (a relay, a BS, a cache-enabled user itself), the average number 
of users who are in the state of (i,j) is riij = 11281 for j = 1(2, 3,4) and z = 1,2,..., 8 , where 

A ' 4 = aA 0 . Up to now, we can divide users associated to a D2D TX (a relay, a BS, a cache- 
enabled user itself) into 8 classes based on the j-th column of the matrix D. The corresponding 
class request arrival rate and class traffic demand are respectively Q t . 3 = and a, :j = 

for j = 1(2, 3,4) and % — 1,2, ...,8. Similarly, let x h j be the number of requests in class i of 
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the queue at a D2D TX (a relay, a BS, a cache-enabled user itself) for j = 1(2, 3,4). And let 
Xj = x 8 .j) be the vector counting the number of user requests in each class. The 

orthogonal transmission is assumed, where user requests arriving at a service node are served 
one after the other in a round-robin manner with equal portion of time. We may view a service 
node as a processor, where 8 classes of user requests with different arrival and service rates are 
queueing to be served. So the request arrivals and departures at a D2D TX (a relay, a BS, a 
cache-enabled user itself) can be regarded as a multiclass processor-sharing queue. 

Denote D := {1,2, ...,8} as the set of classes. Let the process {Xj(t);t > 0} describe the 
number of user requests in different classes of the queue at a D2D TX (a relay, a BS, a cache- 
enabled user itself) at time t for j = 1(2, 3,4). Then X 3 (t) has discrete state space N B and is a 
continuous-time Markov process which can be generated by: 


q( Xj , Xj + Ei) = &J, x 3 G N B 

q(xj, Xj - Ei) = , Xj G N B , Xj > 0. 

where e % represents the vector of N B whose v-th element is 1 and 0 elsewhere. = ^ ieD 
represents the total number of users in the queue. As to a steady network, the number of the 
requests leaving and arriving at the cell should be equal in the long run, i.e., the throughput is 
equal to the traffic demand. Define the throughput per request as the ratio of the given throughput 
(i.e., the traffic demand) by the mean number of user requests for a steady system lt32l . Then, 
Proposition 2: The mean number of user requests, the throughput per user request (Thr./Req.), 
and the delay of the i-th class (i = 1, 2,..., 8) at a D2D TX (a relay, a BS, a cache-enabled user 
itself) for j = 1(2, 3,4) are respectively given by, 


Nij = 


a, 


hi 


2 _ 

a c,j 


A 


T ■ = 


hi 


Vi 


v, 


C ,1 


A 


hji 


D i,j = 


\ — 3 

a c,j 


■Aij qS 


-1 


(36) 


where aj = , a t j can be considered as the total traffic demand in the queue at a service 

node. <j c j = — aj —t is a critical value such that the queue will be at the steady state when 

(Tj < ( 7 C j. And the mean number of user requests, the Thr./Req., and the delay in the queue at 
a service node respectively are, 


N 3 = 


Vi 


a c j <7j 


Tj — a c j aj, 


- aj 

Dj =- - -—, 

\ a c,j — Vj)Cj 


(37) 


where Q = is considered as the total traffic arrival rate of the queue at a service node. 









19 


Proof: The results can be deduced from |[32ll and the references therein, and the proof is 
omitted in this paper to avoid the unnecessary repetition. ■ 

Theorem Q] indicates that when the interference is dominant and a < a*, the service rates keep 
constant despite the change of the density of the BS/relay/D2D. It may lead to a misconception 
that the infrastructure can be deployed as scattered as possible. However, Proposition [2] reveals 
the critical condition to keep the system steady, i.e., Oj < o c ,j , Vj = 1, 2,..., 4 should be satisfied 


when arranging the network. We call -p- steady ruler of the network. The critical condition 
decides the maximum load/throughput of the system, e.g., the maximum arrival rate (<,*) of the 
request, 


S = max 





(38) 


Besides, Proposition [2] points out the maximum ratio of ¥-,i — 1,2,3 for the network plan- 
ning. Apparently, smaller densities of network infrastructures means more user connection per 
BS/relay/D2D, which will destruct the steady state of the queue and lead to the request conges¬ 
tion. 

Analyzing Proposition [2] we observe that, larger content size S, arrival rate c, and 1 are not 
helpful for the improvement of the performance. Higher service rate (Ai,j) is important for the 
smooth departure of requests, yielding the performance improvement in terms of the Thr./Req. 
and delay. Moreover, the network performance highly depends on the number of users in each 
class (rijj), which are determined by the transmission power (P,), node density (A,), content 
popularity ( 7 ), caching ability (Mi, M 2 ) and association protocol. Specifically, when the user is 
able to obtain contents from its local caching immediately, the Thr./Req. (the delay) tends to 
infinity (zero) for the fact that the value of A 7 4 = Ui is extremely high. 


VI. Numerical Results 

In this section, we simulate the cache-enabled network to verify the performance of the 
proposed system. We obtain the results with Monte Carlo methods in a square area of 2000m x 
2000m, where the nodes are scattered based on independent homogeneous PPPs with intensi¬ 
ties of {Ao,A 2 ,A 3 } = {7H2, 75§ o 2 t , 75^} nod es/m 2 . The transmit powers are {Pi,P 2 ,P 3 } = 
{23, 33,43} dBm and 20 MHz bandwidth are shared among different tiers. We set the path-loss 
(5 = 4, total number of contents N = 200, the size of each content S = 100 Mbits, the caching 
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Fig. 2. The subfigure (a) illustrates the probability of users associated to different service nodes; the subfigure (b) illustrates 
the probability of users active in different Cases; a = 0.1. 


ability Mi = 5 and M 2 = 50, and the content popularity 7 = 0.8. These typical parameters do 
not change unless additional statements are clarified. 

As illustrated in the subfigure (a) of Fig. [2l the probability for the user to obtain contents from 
the D2D TX or its local caching space becomes higher with the increase of 7. Smaller fraction of 
users need to access the relay (Relay-total in the figure) or BS with more “concentrated” contents. 
It implies that cache-enabled network reduces the cell load of the BS tier and the relay tier. The 
number of users accessing content from the caching space of relays (Relay-cache in the figure) 
increases first and then deceases because of the traffic offloading ability of the D2D tier. In the 
subfigure (b), the probabilities of Case 1 and Case 4 increase with 7 as more contents can be 
obtained via the D2D link or from the local caching, yielding the increase of the cache hit rate. 

The theoretical estimates and simulating results of the the average ergodic rates in Case 1-3 
are illustrated in Fig. [3] and they are consistent well. We obtain a* = 13.78% and a = 21.96% 
with the parameters in sub figure (a) based on (fl4l) . The rate in Case 1 keeps constant when 
a changes in the region [0, a*] and it starts to increase obviously from a*. It highly depends 
on whether all of the cache-enabled users are active as D2D TXs. We observe that the number 
of active D2D TXs increases linearly with a when a < a* as all of the cache-enabled users 
need to be active as D2D TXs. Only a part of cache-enabled users are active as D2D TXs when 
a > a* and it comes to the maximum number when a = a. As to Case 2, the rate decreases 
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(a) (b) 


Fig. 3. The average ergodic rates of different Cases: the transmit powers of the subfigure (a) are { Pi,P 2 ,P 3 } = 
{23,33,43} dBm, those of the subfigure (b) are { Pi , P 2 , P 3 } = {13,33,43} dBm. The left and right ordinate (black and 
red) respectively correspond to the average ergodic rates and the number of active D2D TXs. 


with the increase of a in the region a < a owing to the increase of D2D TXs, then it increases 
slightly after a for the number of active D2D TXs goes down. The rates in Case 2 and Case 3 
are smaller than those in Case 1. It is because the users of Case 2 and Case 3 can not obtain any 
benefit except for the unnecessary interference from D2D TXs. With the parameters in sub figure 
(b), we get a* = 0 and a = 31.62%. It means the transmit power and coverage of the D2D link 
are limited, thus only parts of cache-enabled users are active as D2D TXs for an arbitrary a. 
The number of active D2D TXs increases with the increase of a when a < a. Consequently, the 
rate of Case 1 increases but those of Case 2 decrease with the increase of a in sub figure (b). 

The outage probabilities of different Cases are demonstrated in Fig. [4] The outage probability 
decreases with the decrease of SINR target r, where lower SINR target means more interference 
is allowed. Similar to the average rate, the outage probability of Case 1 keeps constant before 
a goes to a* and then decreases obviously. Case 2 and Case 3 have higher outage probability 
compared with Case 1. Fig. 0] can be explained from another perspective with Fig. [5] where 
the CDF of the SINR are demonstrated. As an example, for SINR = —10 dB in Fig. [5] the 
value of the CDF of Case 2 is smaller than that of Case 3 when a = 0.05, while the former 
approximately equals to the latter when a = 0.1. It conforms to what is illustrated in Fig. |U 
Moreover, both 0.05 and 0.1 are smaller than a* = 13.78%, so the CDF of SINR for Case 1 
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0.1 0.2 0.3 0.1 0.2 0.3 

Fraction of cache-enabled users:a Fraction of cache-enabled users:oc 
(a) (b) 


Fig. 4. The outage probabilities of different Cases: the SINR threshold of the subfigure (a) and (b) are r = — lOdB and 
t = — 5dB, respectively. 



(a) (b) 


Fig. 5. The CDF of SINR for different Cases: the fraction of cache-enabled user in the subfigure (a) and (b) are a = 0.05 
and a = 0.10, respectively. 


when a = 0.05 coincides with that when a = 0.1 in Fig. [5j 

Fig. [6] compares the Thr./Req. of the cache-enabled network with that of the baseline. As in 
the subfigure (a), the Thr./Req. of class 1 (class 3) at the BS is higher than (approximately equals 
to) that in the baseline. Meanwhile, because of the strong unnecessary interference triggered by 
the D2D TXs, class 5 has lower Thr./Req. than that of the baseline. The other virtual classes 
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whose Thr./Req. are zero are not illustrated in the figure. Similarly, The performance of the j-th 
(j = 1, ...,4) class at the relay is better than or approximately equals to that in the baseline, 
while class 5 and class 6 get worse. Additional process is needed for the relay to fetch the 
uncached contents via the wired backhaul link, so the Thr./Req. of class 4 (class 6 ) is smaller 
than that of class 3 (class 5). In Fig. | 6 ] (c), we compare the Thr./Req. at the D2D TX with that 
at the BS in the baseline. The Thr./Req. of D2D TX outperforms that of BS in the baseline by 
46.8%-58.1%. D2D TXs give rise to interference, yet at the same time traffic loads of BSs and 
relays are offloaded by D2D TXs and the caching resources. Consequently, the Thr./Req. in the 
queue at the relay and BS are not seriously affected by the interference, while the throughput 
over the entire network increases significantly because of the increase of the number of the 
simultaneously active nodes and the tolerable request arrival rate. 

We present the steady ruler versus the request arrival rate in Fig. [7] to evaluate the throughput 
gain of the network. Based on the value of the steady ruler — for different typical queues, we 

a c,j 

circle out the critical point for the network to keep steady. From the figure we can see that the 
steady ruler of the relay and the D2D are far smaller than that of the BS. So the maximum load 
of the network, e.g., the maximum request arrival rate c,*, is decided by the state of the queue at 
the BS. The BS has a wider range of the coverage compared with that of the relay and the D2D 
owning to the higher transmit power. Most of users are covered by the BS and join in the queue 
at the BS. According to the critical point, we observe that when 7 = 0.8 (1.8) the throughput 
gain over the entire network is 13.3% (57.3%) compared with that of the baseline. Moreover, 
the steady ruler of the relay and the D2D is smaller than 1, so the relay can be deployed in the 
high-density area, and more opportunity should be given to the user to access the content via 
the D2D link, yielding the efficient offloading of the cellular traffic. 

For further discussions, we divide the time into slots with equal duration for content trans¬ 
mission. Requests with different volumes of contents are responded with corresponding rate in 
several slots. In the simulation, we choose 500 slots of them to investigate the number of user 
requests at the D2D TX in each time slot, based on which the average number of user requests 
during the 500 slots are also illustrated in Fig. [ 8 ] We observe that the average number of user 
requests in the simulation is lower than that of the analysis as dummy transmissions are assumed 
in the analysis. More precise analysis can be a promising topic for the further work and the 
analysis result in this paper is a lower bound of the performance for the cache-enabled network. 
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Fraction of cache-enabled users: a 
(c) 


Fig. 6. The subfigure (a), (b) and (c) respectively illustrate the Thr./Req. at the BS, relay and D2D TXs: {Pi, P 2 , P 3 } = 
{13, 33, 43} dBm, {A 0 , A 2 , A 3 } = {nodes/m 2 , c = 0.25, g = 1. 



Fig. 7. The throughput gain for the cache-enabled network compared with that of the baseline: {Pi,P 2 ,Pi} = 
{13, 33, 43} dBm, {A 0 , A 2 , A 3 } = { Jg*, Jgr, ^} nodes/m 2 , c = 0.25, g - 1, a = 0.25. 


VII. CONCLUSION 

The paper aims to model and evaluate the performance of the wireless HetNet where the RAN 
caching and D2D caching coexist. The caching ability is available in both the relay and some of 
the users. We propose to cache the most popular multimedia contents via broadcasting during off- 
peak time to be reused for frequent access. Firstly, we model the node locations of the HetNet as 
mutually independent PPPs. According to the maximum received-power cell association scheme, 
users can flexibly connect to the cellular and D2D link. Users are classified into four Cases 
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Fig. 8 . The number of user requests in the queue of the D2D TX: {Pi,P 2 ,P 3 } = {13,33,43} dBm, {Ao, A 2 , A 3 } = 

{Jp, T 5 §o 2 -} nodes/m 2 , ? = 0.25, g = l,a = 0.25, slot-step = 0.2 second. 


according to whether the requesting user is cache-enabled and the type of the service node. We 
theoretically elaborate the average ergodic rates and the outage probabilities of different Cases 
in the downlink. Based on the Case the user is active in, the user requests arriving at a D2D TX 
(relay, BS, cache-enabled user itself) can be classified into different classes. The throughput and 
the delay of different classes are then derived with modeling the multiclass processor-sharing 
queue and the continuous-time Markov process. We further provide the steady ruler for the 
HetNet, which decides the maximum traffic load/throughput of the network. Numerical results 
show that the global throughput of the cache-enabled system can increase by 57.3% compared 
with that of the system without caching ability. 


Appendix 

A. Proof of Proposition [7] 

With extension of ([7]), the joint PDF of the distance R\. IR ■..., Rk is 
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The Euclidean region Q satisfying C tl >C t2 >... >C tK _ 1 > C, K is 

0 < r tl < Too, 


f) = < 


" n 1 < r t2 < Too, 


(40) 


Ptr. 


Pt f 


r tK < r t < Too. 


So the probability of C tl > C t2 >... > C tK _ l > C tK is 


^(C tl > c t2 > ...> C tK _ 1 > C tK ) — 



fRt v Rt 2 , • • ;Rt K ( r ti ,rt 2 ,-,r tK ) dry x dr A _ 1 ...dry 



K 


= n 


(a) 


K -1 




n =1 


if 


E A^ / Pt. 
\ t 

^ L n 


, (41) 


where (a) follows when integrating with the region of 0, and the proof is completed. 


B. Proof of Lemma [7] 

In this Case, we only need to ensure that Ci is higher than that of any other tiers, and the 
order of the other tiers need not to be cared. So we have 


Qk:, i — P (Ci > max C n ) — P(C( > Ci,..., Ci > Ci— i, Ci > C^+i,..., Ci > Ck ) 



fRi,...,Ri<(p i) ...,r A )dr A dr A _ 1 ...dr 1 = 


K „ .2 

A. P, 
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. A*. \ Pt 

rn=j J N J 
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where the integral region now is 


(I = 


0 < ry < Too, 

p. \ 0 

-jf) Ti<r i < Too, 


P / ry < ry_i < Too, 
p i± i\ p r . < r . +1 < +OC; 


(^) ^ <r K < Too. 


(42) 


(43) 


Then we get the lemma. 
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C. Proof of Theorem 1 

According to (fT71) . we have 

Ui,i= / EsiNRi ln(l + SINRj(x)) x f Xi {x)dx. 

Jo L 

Because of E[Af] = f^°P(X > t)dt when X > 0, we have 
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3 = 1 

where Step (a) follows the fact that are mutually independent PPPs. Here, the interference 
comes from the actually active nodes in the z-th tier with density A' for i = 1,2,3. So the 
Laplace transform Cj. \:c 3 Pf l (eJ — 1)] is 


C,, [x‘>p- 1 (e‘ - 1)] = E,, 
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where Zj = x(^)l is the distance between the reference user and its closest interference node. 
Meanwhile, by using change of variables with u = [x^^-(e* — 1 )]~^y 2 , we have Step (a). In the 
expression above, we use Z 1 (e t —1) = 2 F 1 [1 ,1 — 2 — 1 — P], where 2 Fi[-] denotes 

the Gauss hypergeometric function. Accordingly, we have 
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Using <HB, A j (Zl) p = ^r, and A 2 = A' 2 , A 3 = A' 3 , we get dH). 
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D. Proof of Theorem 2 

Referring to the analysis of Theorem Q] we have 
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For j — 1, the distance between the reference user and its closest interfering cache-enabled user 
can be as close as 0. The Laplace transform Cj l \x^ P~ l {e t — 1)] can be computed as 
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where a and a are variables as small as 0 and Z 2 (a) = (e 4 —1) p 2-^1 [1,1 — 2 — — a 2 ]. 
Plugging (l47l) . (l22l) and (l50l) into (l49l) . we obtain the average ergodic rate of the reference user 
in m based on Zj= 2 )* = and Ej =2 w (= iSfe- ■ 


E. Proof of Joint PDF of X x , Yj 


The joint probability of 0 < Xi < x,0 < Yj < y,(y > (g>x) is 
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F. Proof of Theorem 3 

In this Case, (l44l ) can be rewritten as 


r»00 pOO 

U:ij = 1 / EsinRj 

'0 Jo 


ln(l + SINRj(y)) x , y f Xl ,y 3 (x, y)dxdy 


(53) 













29 


Similar to (1451) . we have 


E. 


SINR, 


ln(l + SINRj (y)) 


x : y 


P 


9j ,o > y^Pj 1 I r {e t - 1) X,y d t, (54) 


where 


P 


9j,o > y^P^W - l) 


x,y 




\c,, [/P/V-l)]. (55) 


i= 1 


Similarly, the Laplace transform of /* for i = 2, 3 is 
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Considering cr 2 —> 0, we have 
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where Z 3 (e f — 1) = ^^x ^ 2 -fi[1,1 — ■§; 2 — (1 — e*)x ^]. By using a change of variables 
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Based on P 7 ] } P 3) k = Q 3 J, (l59l) turns to (l27l) and the proof is finished. 
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G. Proof of Theorem 4 

Accordingly, it is easy to obtain 
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//. Proof of Theorem 5 


According to (1281) . the average outage probability is given by 
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Similar to (l50l) . the Laplace transform of the interference resulted from the first tier becomes 
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We then have the theorem. 
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I. Proof of Theorem 6 
Accordingly, we have 
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